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U nderstanding the eoneept of the sampling distribution of a statistie is 
essential for the understanding of inferential proeedures. 
Unfortunately, this topie proves to be a stumbling bloek for students in 
introduetory statisties elasses. In efforts to aid students in their under- 
standing of this eoneept, alternatives to a leeture-based mode of instruetion 
have been introdueed in the literature with some of these approaehes using 
in-elass aetivities, simulations using statistieal software, and web-based 
applets. In this paper, the use of statistieal software, for the purpose of 
illustrating sampling distributions, is revisited through the use of Minitab 
maeros, an approaeh that has not been observed in the literature. The 
result is a user-friendly way for students in introduetory statisties elasses 
to explore the eoneept of the sampling distribution of a statistie. Although 
the foeus of this artiele will be on the sampling distribution, the methods 
deseribed in this paper are applieable to instruetion of other statistieal 
eoneepts. These inelude eonfidenee intervals and power. 


Introduction 

In the statisties edueation literature related to introduetory statisties 
instruetion, many researehers advoeate the use of eomputer simulation as 
a means of teaehing diffieult and abstraet eoneepts, e.g. the sampling distri- 
bution of a statistie (see for example Shibli 1990; Prybutok, Bajgier, and 
Atkinson 1991). It ean be argued that the benefits derived from using this 
approaeh ean be explained using eonstruetivist theory (Mills, 2002). 
Construetlvist theory suggests that one does not aequire new knowledge 
through passive means, e.g. reading a textbook, listening to a leeture, and 
memorising faets (Von Glasersfeld, 1987). For one to aequire new knowl- 
edge, one must internet with the new knowledge. Taking the eonstruetivist 
viewpoint, it ean be argued that statisties students benefit from the simu- 
lations by being able to eonstruet their own knowledge of a statistieal 
eoneept through the simulation experienee. 

It is important, however, that eomputer simulations are implemented in 
sueh a way as to not interfere with the learning that might otherwise take 
plaee. Today, a popular means of implementing simulations is through the 
World Wide Web in whieh students go through an interaetive program to 
generate the simulations. Sueh programs are indeed useful in that they are 
user-friendly and provide a elear visual presentation of eoneepts. One draw- 
baek of many of these interaetive programs is that, in using the program. 
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the student do not see many aspeets of the simulation on sereen, e.g. the 
aetual population values, the observations in eaeh random sample and the 
mean for eaeh sample. With applets illustrating the sampling distribution 
for a statistie, for example, students generally only see the parent popula- 
tion distribution and the sampling distribution of the statistie. Being 
distaneed from the details of the simulation in this manner takes away from 
the learning of the eoneepts. 

In this paper, an attraetive alternative to interaetive web programs is 
through the use of Minitab (2004) statistieal software to perform and 
display results from simulations. The idea of using Minitab, for the purpose 
of teaehing abstraet eoneepts through simulations, is not a new one. The 
purpose of this paper is to illustrate a very user-friendly approaeh to imple- 
menting simulations in Minitab, an approaeh that has not been observed in 
the literature. Using the approaeh deseribed in this paper, students are able 
to see the details of the simulations on sereen, in a spreadsheet. They ean 
see the aetual values in the population, the values in eaeh random sample, 
and the resulting sample statisties obtained from the different samples. 
Using this approaeh enables students to move eloser to the details of the 
simulation thus providing a better understanding of the eoneepts. 


Running simulation macro in Minitab 


In an introduetory statisties eourse, students are exposed to numerous 
statistieal inferenee proeedures. To have any understanding of these proee- 
dures, it is essential that students have a good understanding of the 
eoneept of the sampling distribution of a statistie. To aid in the under- 
standing of this eoneept, simulations are often used. Following is an 
illustration of how simulations ean be easily implemented in Minitab. 

The purpose of the simulation below is to illustrate the sampling distri- 
bution of the sample mean. Random samples of size n = 5 are seleeted (with 
replaeement) from a population eonsisting of values the authors have asso- 
eiated with the eards in a standard deek of 52 eards. For eaeh randomly 
seleeted sample, the sample mean is ealeulated and results are displayed in 
a Minitab worksheet. 



Figure 1. Population data entered into Minitab. 


Step 1 

In the data window in 
Minitab, enter population 
data as illustrated in Figure 1 . 

At this point, the student 
ean go through the menu 
options in Minitab to ealeu- 
late the mean and standard 
deviation, as well as 
eonstruet a histogram to 
examine the shape of the 
population distribution. 
These steps eould be ineor- 
porated into the maero but 
it is preferable to have the 
students go through these 
steps on their own. For the 
example population, the dis- 
tribution is diserete uniform. 
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step 2 

Go to the session window in Minitab and enable the eommand line editor. 
This ean be done by elieking editor and then enable eommands in the menu 
options. When this is done, the eommand prompt, “MTB>”, will appear in 
the session window . 


Step 3 

Run the simulation maeros. These maeros ean be found in the Appendix. 
ASCII format and a .mtb extension in the file name were used for the maero 
files. We use these maeros only for the purpose of illustrating statistieal 
eoneepts, not for teaehing students how to write a program in Minitab. As 
sueh, students are not required to write their own maero, only run the 
simulation program provided to them. However, it would be appropriate to 
diseuss with the elass what aetions are being implemented by the maero. 

The maero is implemented by going through the following sequenee of 
menu options in Minitab. 

1 . File 

2. other files 

3. run an exee 

4. open the simulation program and exeeute the program 1 time. 

The user is asked to speelfy (in the session window) the sample size and 
how many samples to seleet 


from the population as illus- 
trated below. As shown in 
Figure 2, 500 samples, eaeh 
of size 5, are seleeted and 
the mean is ealeulated for 
eaeh sample. 

The interaetive proeess 
above serves two important 
funetions. One, having 
students speeity the sample 
size and the number of 
samples reinforees what is 
being done in the simula- 
tion. Two, the interaetive 
proeess allows students to 
rerun easily the simulation 
for different sample sizes, 
preventing diffieulties in the 
implementation of the simu- 
lation from interfering with 
the learning of the eoneepts. 
Step 4 

Observe the simulation 
results. The results of the 
simulation ean be seen by 
going to the data window as 
shown in Figure 3. 

The mean and standard 
deviation of the sample 
means ean now be ealeu- 
lated and a histogram ean 
be eonstrueted to illustrate 
the properties of the 



Figure 2, Simulation initiated in Minitab session window. 
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Figure 3, Simulation results displayed in data window. 
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sampling distribution of the 
sample mean. This is easily 
done through using the 
menu options in Minitab, 
with the results of one simu- 
lation shown in Figure 4. 


Sampling 

distribution activity 

For students to reap the 
benefits of the simulation 
aetivity presented in this 
paper, it is important that 
the simulation is aeeompa- 
nied with questions related 
to the simulation, and it is 
important that these ques- 
tions are graded. As stated 


Histogram of Sample Mean 
Normal 


Mean 

8.017 

StDev 

1,726 

N 

500 



Figure 4, Approximation of the sampilng distribution of the 
sampie mean based on 500 sampies (n=5) from the 
popuiation of card vaiues. 


in Garfield’s ten prineiples of learning statisties (Garfield, 1995), Students 
learn to value what they know will be assessed. In an introduetory statis- 
ties elass at Washington State University, the simulation is made part of a 
graded assignment. Eaeh week in this elass, students are required to attend 


Lab activity 

Purpose of the Lab 

To reinforee the eoneept of 
sampling distribution of the 
sample mean through the use 
of a Minitab simulation. 

Steps 

1. Enter the values for the 
population (the eard values 
for a standard deek of 52 
eards) into Minitab. 

2. Look at the distribution for 
the population of eard 
values from whieh you will 
be seleeting random 
samples. Deseribe the 
shape of the distribution. 
Caleulate the mean and 
standard deviation for the 
population. 

3. Using the Minitab simula- 
tion maero, draw 500 
random samples, eaeh of 
size 5, from the population 
of eards values and ealeu- 


late the mean for eaeh 
sample. Construe! and 
examine the distribution of 
the sample means. This is 
an approximation of the 
sampling distribution of x 
when n=5, i.e. the distribu- 
tion of values taken by x in 
all possible samples of the 
same size (n=5) from the 
same population (the 
population of eard values). 

4. Looking at the individual 
eard values in the parent 
population and looking at 
the sample means 
obtained from the different 
samples, whieh seem to 
vary more? 

5. From what we learned 
about the sampling distrib- 
ution of X in elass, the 
mean of the sampling 
distribution of x you 
obtained should be elose to 
p = 8, the mean for the 
population of eard values. 


= 1.673 


Also, the standard devia- 
tion for the sample means 
you obtained should be 

elose to 

g _ 3.742 

^^n Vs 

the standard deviation of 
the original population 
divided by the square root 
of the sample size. 
Caleulate the mean and 
standard deviation for the 
sample means and see if 
this is the ease. 

In elass, we learned that if 
the original population 
distribution is diserete 
uniform, the sampling 
distribution of x, for n=5, 
will look approximately 
normal. Is this what you 
observed in your simula- 
tion? 
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three one-hour leetures as well as a two-hour lab session. The assignment 
is eompleted in one of the lab sessions. The following is an assignment one 
eould eonsider using in a statisties lab session. It is assumed in the 
following lab that the steps involved in rurming the Minitab simulation 
(presented earlier) have already been introdueed to the students. 


Conclusion 

In teaehing introduetory statisties eourses, the eoneept of the sampling 
distribution of a statistie proves to be one of the most diffieult for students. 
Yet, this eoneept is fundamental to understanding statistieal inferenee. The 
approaeh presented in this paper is an effeetive way of approaehing the 
teaehing of this diffieult and abstraet eoneept to introduetory students. The 
following are several reasons we provide for the sueeess observed from 
using this approaeh. 

1. Construetivist theory states that students learn best, not through 
passive means as in a leeture, but through aetive means when they 
are able to interaet with the eoneept and eonstruet their own knowl- 
edge. Going through the simulation in Minitab enables students to 
eonstruet their own knowledge of the sampling distribution of a 
statistie as opposed to having the eoneept passed on to them through 
leeture. 

2. Students are sampling from a population for whieh they are familiar. 
Almost everyone is familiar with a deek of eards. This is very different 
from telling students that they are sampling from, for example, a 
normal distribution with p=5 and CT=1, as is often seen in applets and 
in textbooks. 

3. Students are not distaneed from the simulation proeess. With the 
approaeh presented in this paper, they see the aetual population 
values of the parent distribution, the values in eaeh sample seleeted, 
and the sample means ealeulated from eaeh sample. Seeing all of 
these details is important for students in introduetory statisties 
eourses. 

4. It is very easy to implement the approaeh. This is important beeause 
if students have to struggle to implement the approaeh, this will inter- 
fere with the learning that takes plaee. 

In this paper, the foeus has been on demonstrating the use of a Minitab 
maero for the purpose of illustrating the eoneept of the sampling distribu- 
tion of the sample mean. There are also benefits from using Minitab maeros 
for illustrating other statistieal eoneepts, e.g. eonfidenee intervals and 
power. 

One issue is the length of time it takes for the maero to run. This time 
ean vary eonsiderably depending on the speed of the maehine being used. 
Given this, it is important to perform a test run of the maero before using 
it in a elass. For example, it took 25 seeonds for the simulation to run on 
the eomputer used by the authors. This is longer than it takes for web 
applets to run, whieh return results instantaneously. However, given the 
benefits to students from utilising the Minitab maeros, this inerease in the 
amount of time for the simulation to run is eompensated by an inereased 
understanding of the eoneept of the sampling distribution of a statistie. 

Many approaehes have been suggested in the literature for teaehing the 
eoneept of sampling distribution of statistie. The approaeh presented in this 
paper is another to eonsider. 
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Appendix: Minitab macros 

PRIMARY.MTB: 

FILE "TERMINAL"; 

NOBS I. 

Note 

Note Enter the Number of Samples to be Seleeted 

Note 

Set e99; 

FILE "TERMINAL"; 

NOBS 1. 
let kl = elOO(I) 
let k2 = e99(l) 

Name e4 "Sample" 

Name e5 "Observation" 

Name e6 "Sample Data" 

Name e7 "Sample Mean" 
set el 04 
kl(l) 
end 

let el02 = k2*el04 
Exeeute "NestedExee.mtb" k2. 

Erase e99-el04 
Erase e3 

NestedExec.mtb : 

Sample kl 02 03; 

Replaee. 
set el 03 
(l:kl) 
end 

staek el 02 e4 e4 
staek el 03 e5 e5 
staek e3 e6 e6 
Let K3 = Mean(03) 

Let OlOl(kl) = k3 
staek elOl c7 c7 
let k2 = k2 - 1 
let el02 = k2*el04 
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